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(57) Abstract 

The present invention discloses steroid/thyroid hormone receptor DNA binding domain compositions that determine tar 
get gene sperificity. Hie invention further discloses methods converting the target gene specificity of one receptor into the target 
gene specificity of another. Still further the invention discloses novel assays for identifying ligandsf or orphan hormone receptors. 
These assays are especially useful since they avoid the necessity of constructing chimeric genes and proteins in order to search for 
ligaads that can activate a putative receptor. 
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HORMONE RESPONSE ELEMENT 
COMPOSITIONS AND ASSAY 

FIELD OF THE INVENTION 
5 The present invention relates generally to 

hormone receptor genes and proteins. More particularly , 
the present invention relates to identification and 
characterization of determinants that govern target gene 
specificity for the superfamily of steroid/thyroid 

10 hormone receptors. In addition the invention relates to 
novel assays for identifying ligands for "orphan" 
hormone receptors. 

BACKGROUND OF THE INVENTION 
The steroid/thyroid hormone receptors form a 

15 superfamily of ligand-dependent transcription factors 

that influence cell function and fate in eukaryotes . It 
is known that these receptors transduce extracellular 
hormonal signals to target genes that contain specific 
enhancer sequences referred to as hormone-response 

20 elements (HREs) (Evans 1988; Green and Chambon 1988). 
It is also known that each receptor recognizes its own 
HREs, thus assuring that a distinct response is 
triggered by different hormones. 

sequence comparisons and mutational analyses of 

25 glucocorticoid receptor (GR) have identified functional 
domains responsible for transcriptional activation and 
repression, nuclear localization, DNA binding, and 
hormone binding (Gigure, et al., 1986; Hollenberg, et 
al., 1987; Rusconi, et al., 1987; Picard and Yamamoto, 

30 1987; Hollenberg and Evans, 1988; Oro, et al., 1988a). 
The DNA binding domain, which is required to activate 
transcription, consists of 66-68 amino acids of which 
about 20 sites, including nine cysteines (C 1 to C 9 ) , are 
invariant among different receptors (Fig. la) . The 

35 modular structure of members of this receptor super- 
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family allows the exchange of one domain for another to 
create functional chimera. This strategy was used to 
dem nstrate that the DNA binding domain is solely 
responsible for the specific recognition of the HRE in 
5 vivo (Green and Chambon, 1987; Giguere, et al./ 1987; 
Petkovich, et al., 1987; Kumar, et al., 1987; Umesono, 
et al., 1988; Thompson, et al., 1989) and in vitro 
(Kumar and Chambon, 1988) . 

By analogy with the proposed structure for 
10 Xenopus transcription factor TFIIIA (Miller, et al., 

1985) , the invariant cysteines are thought to form two 
"zinc fingers" for specific DNA binding. In a poly- 
peptide encompassing the DNA binding domain of rat GR, 
it has been shown that each of two Zn(II) is coordinated 
15 in a tetrahedral arrangement by four cysteines 
(Freedman, et al., 1988). Involvement of these 
cysteines in Zn(II) coordination is also supported by 
the fact that eight out of nine cysteines, enough to 
chelate two Zn(II) , are required for the receptor 
20 function revealed by point mutagenesis experiments 
(Hollenberg, et al., 1988; Severne, et al., 1988). 

Functional models have been proposed in an 
attempt to coordinate research results with gene 
regulation mechanisms that function in vivo. As those 
25 skilled in the art will know, one such model for the DNA 
binding domain is "zinc finger model". (A predicted 
"finger" structure is presented in Fig. lb; also see 
Severne, et al., 1988.) In this model, the first four 
cysteines (C, to C 4 ) chelate one Zn(II) to form Finger 1, 
30 which includes a loop of 13 amino acids between and 

Cj. Finger 2 is formed by the next four cysteines (Cj to 
C 8 ) , since function is retained when the ninth cysteine 
is changed to an alanine or serine (Severne, et al., 
1988). Finger 2 has a loop of 9 amino acids, and is 
35 separated by a "Linker" of 15-17 amino acids from Finger 
1. Both fingers are functionally required becaus 



As the present invention discloses, functional 

20 characterization of mutant receptors carrying chimeric 
DNA binding domains has made it possible to dissect 
molecular determinants of target gene specificity for 
GR, ER, and TR. For example, as th present invention 
discloses, the identity of GR DNA binding domain can be 

25 converted into those of ER and TR by changing three and 
eight amino acids, respectively . The present invention 
also discloses that a single Gly to Glu change in the 
first M zinc finger" of the GR receptor produces a 
receptor with dual HRE (i.e., GRE and ERE) sequence 

30 responsiveness. These discoveries localize two 

structural determinants of target gene specificity and 
suggest a simple pathway for the co-evolution of 
receptor DNA binding domains and regulatory networks. 
These discoveries also make it possible to 

35 convert one receptor into another, and to create 

engineered receptors that have desired HRE recognition 
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features. They have also enabled the development of 
assays that are useful for identifying ligands for 
"orphan" hormone receptors. Such assays are especially 
advantageous because they eliminate the necessity of 
5 constructing chimeric genes and proteins in order to 
search for ligands that can activate the orphan 
receptors . 

BRIEF DESCRIPTION OF THE DRAWINGS 
The following is a brief description of the 
10 drawings. More detailed descriptions are found in the 
section of the specification labeled, "Detailed 
Description of the Drawings". 

FIGURE 1 (a-c) is a schematic drawing. Fig. la^ 
shows a comparison of amino acid sequences among the DNA 
15 binding domains of hGR, hT^, and hER. Fig. lb shows 
predicted zinc fingers based on those for rat GR18. 
Fig. 1c shows the structures of optimized hormone 
response elements for GR (GRE) , ER (ERE) , and T 3 R (TRE) . 
' FIGURE 2 (a and b) is a schematic drawing that 
20 illustrates transactivation of lucif erase reporter 
plasmids by mutant receptors. 

FIGURE 3 is a schematic drawing that illustrates 
identification of two distinct elements specifying the 
TREp+ and ERE+ phenotypes. 
25 FIGURE 4 (a-f) is comprised of the combination 

of a schematic drawing and photographs of blots. Fig. 4 
shows induction of CAT activities by mutant receptors 
from the basal AMTV-CAT (AH) , T 3 -responsive AMTV-TREp-CAT 
(TREp) , an estrogen-responsive AMTV-ERE-CAT (ERE) 
30 reporters. 

DEFINITIONS 

In the present specification and claims, 
referenc will be made to phrases and terms of art which 
are expressly defined for use herein as follows: 
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As used herein, species are identified as 
follows: h, human; r, rat; m, mouse; c, chicken; and d, 
Drosophilia. 

As used herein, M st roid hormone super family of 
5 receptors" refers to the class of related receptors 
comprised of glucocorticoid, mineralocorticoid, 
progesterone, estrogen, estrogen-related, vitamin Dj, 
thyroid, v-erb-A, retinoic acid and E75 (Drosophilia) 
receptors. See Evans (1988) and the references cited 
10 therein. 

As used herein, GR means glucocorticoid 
receptor. The DNA referred to as hGR codes for human 
glucocorticoid receptor GR. hGR is encoded by plasmid 
pRShGR which has been deposited for patent purposes and' 

15 accorded ATCC No. 67200. 

As used herein, MR means mineralocorticoid 
receptor. The DNA referred to as hHR codes for human 
mineralocorticoid receptor MR. hMR is encoded by 
plasmid pRShMR which has been deposited for patent 

20 purposes and accorded ATCC No. 67201. 

As used herein, TR means thyroid receptor and T 3 R 
means triiodothyronine (T 3 ) receptor. TRa and TRfi refer 
to the alpha and beta forms of the thyroid receptor. 
Plasmid pherb-A 8.7 encodes hTRa; it has been deposited 

25 for patent purposes and accorded ATCC No. 40374. 

Plasmid peAlOl encodes hTR/9 ; it has been deposited for 
patent purposes and accorded ATCC No. 67244. 

As used herein, ERR means estrogen-related 
receptor. The acronyms, hERRl and hERR2 refer to human 

30 estrogen-related receptors 1 and 2. These receptors are 
more related to steroid receptors than to the thyroid 
receptors, yet they do not bind any of the major classes 
of known steroid hormones (Giguere, et al., 1988). 
hERRl is encoded by plasmids pE4 and pHKA, which hav 

35 been deposited for patent purposes and accorded ATCC No. 
67309 and 67310, respectively. (Neither pE4 or pHKA are 
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complet clones; hEKRl is constructed by joining 
segments from both clon s.) hERR2 is encoded by plasmid 
phH3 which has been deposited for patent purposes and 
accorded ATCC No. 40373. 
5 As used herein , EAR means retinoic acid 

receptor. The acronym, hRARor, refers to human retinoic 
acid receptor alpha. hRARa is encoded by plasmid phRARa 
which has been deposited for patent purposes and 
accorded ATCC No. 40392. 
10 As used herein , VDR means vitamin Dj receptor. 

As used herein, an "orphan" receptor is a 
protein encoded by a DNA sequence that is homologous 
with DNA sequences that encode known members of the 
steroid/thyroid superfamily of hormone receptors. The 
15 term "orhpan" denotes the fact that the ligand(s) which 
bind to the putative receptor are not yet known. 

As used herein, element P means one of two 
clusters of amino acids in the steroid/thyroid receptors 
that can differentially control target gene specificity 
20 of the DNA binding domains. The cluster of amino acids 
that comprise element P for the GR subfamily (GR, MR, 
PR, AR) is "GSCKV"; the clusters that comprise element P 
for the ER subfamily (T 3 Ra, T 3 a, RARa, RAR0, VDjR, NGFIB, 
TRj, v-erbA, ear2, ear3, Knirps, Knirps-related , ERR1, 
25 ERR2) are "EGCKA", "EGCKG" , "EGCKS" , and "EACKA" . Also 
see Table 2. 

As used herein, element D means one of two 
clusters of amino acids in the steroid/thyroid receptors 
that can differentially control target gene specificity 
30 of the DNA binding domains. The clusters of amino acids 
that comprise element D for the GR subfamily (GR, MR, 
PR, AR) are "AGRND" and "ASRND"; the clusters that 
compris elem nt P for the ER subfamily (T 3 Ra, T 3 o, RARa, 
RAR/J, VDjR, NGFI-B, TP^, v-erbA, ar2, ear3, Knirps, 
35 Knirps-related, ERR1, ERR2) are "PATNQ" , "KYDSC", 

"KYEGK", "HRDKN", "PFNGD", "LANKD" , "RGSKD", "TYDGC", 
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M RSNRD", "RANRN 11 , "KNEGK", "KNNGE" , "PASNE" , and 
H PATNE". Also see Table 2. 

As used herein, CAT means chloramphenicol ace- 
tyitransf erase . 
5 As used herein, lucif erase means firefly 

luciferase. See, de Wet, et a!., (1987). 

As used herein, COS means monkey kidney cells 
which express T antigen (Tag). See Gluzman, Cell, 
23:175 (1981). As used herein, CV-1 means mouse kidney 

10 cells from the cell line referred to as "CV-1" . CV-1 is 
the parental line of COS. Unlike COS cells, which have 
been transformed to express SV40 T antigen (Tag) , CV-1 
cells do not express T antigen. CV-1 cells are 
receptor-deficient cells that are also useful in the 

15 assays of the present invention. 

As used herein, HRE means hormone response 
element. HREs are short cis-acting sequences (about 20 
bp in size) that are required for hormonal (or ligand) 
activation of transcription. The attachment of these 

20 elements to an otherwise hormone-nonresponsive gene 
causes that gene to become hormone responsive. HREs 
function in a position- and orientation-independent 
fashion. Unlike other enhancers, the activity of the 
HREs is dependent upon the presence or absence of 

25 ligand. See Evans (1988) and the references cited 
therein. 

As used herein, engineered HREs refer to HREs 
that have been recombinantly produced using genetic 
engineering techniques such as nucleotide substitution, 

30 deletion, etc. If wild-type, engineered or synthetic 
HREs are linked to hormone-nonresponsive promoters, 
these promoters become hormone responsive. See Evans 
(1988) and the references cited therein. 

As used herein, synthetic HREs refer to HREs 

35 that have been synthesized in vitro using automated 

nucl otide synthesis machines. Since the HREs are only 
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about 20 bp in size, they are easily synthesized in this 
manner. If wild-type, engineered or synthetic HREs are 
linked to hormone-nonresponsive promoters , these 
promoters become hormone responsive. See Evans (1988) 
5 and the references cited therein. 

As used herein, the acronym GRE means gluco- 
corticoid response element and TRE means thyroid 
receptor response element. GREs are hormone response 
elements that confer glucocorticoid responsiveness via 
10 interaction with the GR. See Payvar, et al., Cell, 

35:381 (1983) and Schiedereit, et al., Nature, 304:749 
(1983). GREs can be used with any wild-type or chimeric 
receptor whose DNA-binding domain can functionally bind- 
(i.e., activate) with the GRE. For example, since GR, 
15 MR and PR receptors can all activate GREs, a GRE can be 
used with any wild-type or chimeric receptor that has a 
GR, MR or PR-type DNA-binding domain. TREs are similar 
to GREs except that they confer thyroid hormone 
responsiveness via interaction with TR. TREs can be 
20 used with any wild-type or chimeric receptor whose DNA- 
binding domain can functionally bind (i.e., activate) 
with the TRE. Both TR and RR receptors can activate 
TREs, so a TRE can be used with any receptor that has a 
TR or RR-type DNA-binding domain. 
25 As used herein, ligand means an inducer, such as 

a hormone or growth substance. Inside a cell the ligand 
binds to a receptor protein, thereby creating a 
ligand/receptor complex, which in turn can bind to an 
appropriate hormone response element. Single ligands 
30 may have multiple receptors. For example, both the T 3 Ra 
and the T 3 R£ bind thyroid hormone such as T 3 . 

As used herein, the word "operative", in the 
phrase ^operativ hormon response element f fictionally 
linked to a ligand-responsive promoter and an operative 
35 reporter gene", means that the respectiv DNA sequences 
(represented by the t rms "hormone response element"/ 
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"ligand-responsive promoter" and "reporter gene") are 
operational, i.e., the hormone respons elem nt can bind 
with the DNA-binding domain of receptor prot in (either 
wild-type or chimeric) , the ligand-responsive promoter 
5 can control transcription of the reporter gene (upon 
appropriate activation by a HRE/receptor protein/ligand 
complex) and the reporter gene is capable of being 
expressed in the host cell. The phrase "functionally 
linked" means that when the DNA segments are joined, 

10 upon appropriate activation, the reporter gene (e.g., 
CAT or lucif erase) will be expressed. This expression 
occurs as the result of the fact that the "ligand 
responsive promoter" (which is downstream from the 
hormone response element, and "activated" when the HRE 

15 binds to an appropriate ligand/receptor protein complex, 
and which, in turn then "controls" transcription of the 
reporter gene) was "turned on" or otherwise activated as 
a result of the binding of a ligand/receptor protein 
complex to the hormone response element. 

20 As used herein, the phrase "DNA-binding domain" 

of receptors refers to those portions of the receptor 
proteins (such as glucocorticoid receptor, thyroid 
receptor, mineral ocorticoid receptor, estrogen-related 
receptor and retinoic acid receptor) that bind to HRE 

25 sites on the chromatin DNA. The boundaries for these 
DNA-binding domains have been identified and 
characterized for the steroid hormone superfamily. See 
Evans, (1988); also see Giguere, et al., (1986); 
Hollenberg, et al., (1987) ; Green and Chambon (1987); 

30 Miesfield, etal., (1987); and Evans (1988). 

The DNA-binding domains of the steroid hormone 
superfamily of receptors consist of an amino segment 
varying between 66 to 68 amino acids in length. This 
segment contains 9 cysteine residues, one of which is 

35 the first amino acid of the segment. This first Cys 

residu begins a motif, described as Cys-Xj-Cys-X^.^-Cys- 
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X^Cys, where X is any amino acid residue. The DNA- 
binding domain invariably nds with the amino acids Gly- 
Met. 

As used herein, the phrase "ligand-binding 
5 domain region" of receptors refers to those portions of 
the receptor proteins that bind to ligands such as 
growth substances or the hormones. These boundaries of 
the ligand-binding domains for the steroid receptor 
superfamily have been identified and characterized. 
10 See Evans (1988). 

As used herein, "mutating 11 means using genetic 
engineering techniques to alter DNA so that it is 
different from the "wild- type" or unmodified sequences.- 
Useful genetic engineering techniques for altering the 
15 DNA include, but are not limited to, insertion of new 
nucleotides into wild-type sequences , deletion of 
nucleotides from wild-type sequences, and substitution 
of nucleotides in the wild-type sequences, for example 
by site' directed mutagenesis. 
20 As used herein, "mutant" DNA of the invention 

refers to DNA that has been genetically engineered to be 
different from the "wild-type" or unmodified sequences. 
Such genetic engineering includes insertion of new 
nucleotides into wild-type sequences, deletion of 
25 nucleotides from wild-type sequences, or substitution of 
nucleotides in the wild-type sequences, for example by 
site directed mutagenesis. 

Use of the term "substantial sequence homology" 
in the present specification and claims means it is 
30 intended that DNA, RNA, or amino acid sequences which 
have slight and non-consequential sequence variations 
from the actual sequences disclosed and claimed herein 
ar within th scope f the appended claims. In this 
regard, the "slight and non-consequ ntial" sequence 
35 variations mean that the homologous sequences will be 
functionally equivalent to the sequences of the 
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invention, i.e., the homologous sequences function in 
substantially the same manner to produce substantially 
the same compositions as the nucleic acid and amino acid 
compositions disclosed and claimed herein. 
5 As used herein, the term "recombinantly 

produced" means made using genetic engineering 
techniques, not merely purified from nature. 

The amino acids which comprise the various amino 
acid sequences appearing herein may be identified 
10 according to the following three-letter or one-letter 
abbreviations: 

Three-Letter One Letter 

Amino Acid Abbreviation Abbreviation 





L - Alanine 


Ala 


A 


15 


L - Arginine 


Arg 


R 




L — Asparagine 


. asti 


N 




L - Aspartic Acid 


Asp 


D 




L - Cysteine 


Cys 


C 




L - Glutamine 


Gin 


Q 


20 


L - Glutamic Acid 


Glu 


E 




L - Glycine 


Gly 


G 




L - Histidine 


His 


H 




L - Isoleucine 


lie 


I 




L - Leucine 


Leu 


L 


25 


L - Lysine 


Lys 


K 




L - Methionine 


Met 


M 




L - Phenylalanine 


Phe 


F 




L - Proline 


Pro 


P 




L - Serine 


Ser 


S 


30 


L - Threonine 


Thr 


T 




L - Tryptophan 


Trp 


W 




L - Tyrosine 


Tyr 


Y 




L - Valine 


Val 


V 



35 Th nucl otides which comprise th various 

nucleotide sequences appearing herein have their 



I 
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usual single-letter designations (A, G, T, C or U) 
used routinely in the art. 

As used herein, bp means base pairs and kb 
means kilobase pairs. 
5 In the present specification and claims , the 

Greek letters alpha (a), beta (0) , gamma (7)/ etc., 
are sometimes referred to as a, b, g, etc. 

In the present specification and claims, 
unless noted otherwise, temperatures are in degrees 
10 centigrade. 

DEPOSITS 

Plasmids pRShGR (hGR) , pRShMR (hMR) , peAlOl 
(Ht^) and GMOftT, all of which are in E. coll HB101, 
plus plasmids pE4 and pHKA (which together encode 

15 hERRl) , phH3 (hERR2) , pherJb-A 8.7 (hTRa, phFA 8 (a 
partial clone of hTRa, and plasmid phRARa have been 
deposited at the American Type Culture Collection, 
Rockville, Maryland, U.S.A. (ATCC) under the terms of 
the Budapest Treaty on the International Recognition 

20 of Deposits of Microorganisms for Purposes of Patent 
Procedure and the Regulations promulgated under this 
Treaty. Samples of the plasmids are and will be 
available to industrial property offices and other 
persons legally entitled to receive them under the 

25 terms of said Treaty and Regulations and otherwise in 
compliance with the patent laws and regulations of 
the United States of America and all other nations or 
international organizations in which this 
application, or an application claiming priority of 

30 this application, is filed or in which any patent 
granted on any such application is granted. 

The ATCC Deposit Numbers and Deposit Dates 
for the deposits ar as follows: 

pRShGR (hGR) 67200 Sept. 9, 1986 

35 pRShMR (hMR) 67201 Sept. 9, 1986 

pE4 (hERRl*) 67309 Jan. 30, 1987 
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phHKA (hERRl*) 


67310 


Jan. 


30, 


1987 


phH3 (hERR2) 


40373 


Sept. 


29, 


1987 


GKCAT (reporter) 


67282 


Dec. 


18, 


1986 


tsherb-A 8.7 ChTRa) 


40374 


Sent 


/ 


1QQ7 


peAlOl (hTR0) 


67244 


Oct. 


22, 


1986 


phRARa (hRARa) 


40392 


Nov. 


20, 


1987 



(* means a partial clone) 



(pE4 & phHKA together encode complete hERRl) 

DESCRIPTION OF TH"5 TNVENTTON 

10 in one aspect, the present invention comprises 

steroid/thyroid hormone receptor DNA binding domain 
compositions that determine target gene specificity. 
The compositions of the invention are selected from the 
group consisting of GSCKV, EGCKA, EGCKG, EGCKS, EACKA, 

15 AGRND, ASRND, PATNQ, KYDSC, KYEGK, HRDKN, PFNGD, LANKD, 
RGSKD, TYDGC, RSNRD, RANRN, KNEGK, KNNGE, PASNE, and 
PATNE. 

In another aspect, the present invention is a 
cluster of amino acids comprised of EGxxGxxR where x is 
20 selected from the group consisting of A, R, N, D, C, Q, 
E, H, I, L, K, M, F, P, S, T, W, Y, and V. 

In still another aspect, the present invention 
is a cluster of amino acids comprised of KVEGK. 

In still another aspect, the present invention 
25 is comprised of mutant receptors selected from the 
group consisting of GTG8 , GTG3A, GTG2, GTG1. 

In still another aspect, the invention 
comprises a method for constructing a receptor that can 
activate both GRE and ERE sequences. The method 
30 comprises introducing a point mutation into the gluco- 
corticoid receptor DNA sequence so that the encoded 
glycine at the site between Cj and C 4 is replaced with a 
glutamic acid. Conversely, a point mutation is 
introduced into the estrogen receptor DNA sequenc so 
35 the encoded glutamic acid at the site between Cj and C 4 
is r placed with a glycine. 



WO 90/11273 



PCT/US90/01428 



14 

In still another aspect, the invention 
comprises a substantially pure receptor protein that 
can activate-both glucocorticoid and estrogen response 
element sequences. 
5 In still another aspect , the invention 

comprises methods for converting the target gene 
specificity of one receptor into the target gene 
specificity of another. According to this aspect of 
the invention, target gene specificity of the gluco- 
10 corticoid receptor is converted to that of the estrogen 
receptor by changing three amino acids clustered in the 
first zinc finger. A single Gly to Glu change in this 
region produces a receptor with dual sequence 
responsiveness. Further replacement of five amino 
15 acids in the stem of the second zinc finger transforms 
the specificity to that of the thyroid hormone. 

Still further the invention comprises novel 
assays for identifying functional ligands for orphan 
hormone receptors. These assays are especially useful 
20 since they avoid the necessity of constructing chimeric 
genes and proteins in order to search for ligands that 
can activate an orphan receptor. In addition, the 
assays of the invention do not require that the DNA 
sequences in question be sequenced to see if they 
25 belong to the GR or ER/TR subfamilies (although this 
can be done) • Furthermore, as is done in the preferred 
assay of the invention, the presence of two reporter 
genes in the host cell, one operatively linked to a GRE 
and the other to a ERE, makes it possible to assay all 
30 unknown members of the steroid/ thyroid receptor super- 
family with a single assay system. In addition, a 
cocktail of ligands can be tested at once. If any of 
th ligands activate either of the reporters these 
ligands can be retested, e.g., in smaller "cocktail" 
35 groups, and then separately. This greatly increases 
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the efficiency of the search for ligands for the orphan 
receptors . 

According t one assay aspect of the 
invention, DNA sequences are isolated that are 
5 suspected of encoding receptor proteins. These DNA 
sequences are trans fected into a suitable receptor- 
deficient host cell that has been engineered to contain 
at least one reporter gene functionally linked to at 
least one operative hormone responsive element wherein 

10 the hormone response element (s) is selected from the 
group consisting of wild-type, engineered or synthetic 
glucocorticoid response element and wild-type, 
engineered or synthetic estrogen response element. The 
transfected receptor^deficient host cell (which now 

15 contains the suspected or "orphan" receptor and at 
least one reporter/HRE complex) is challenged with at 
least one candidate ligand(s) that can potentially bind 
with the ligand-binding domain region of the putative 
receptor protein encoded by the DNA sequence in 

20 question. The induction of the reporter gene is 

monitored by means of changes in the protein levels of 
the protein encoded by the reporter gene. Finally a 
selection is made of ligand(s) that is capable of 
inducing production of the protein product of the 

25 reporter gene. 

In preferred assays of the invention, the 
transfected host cell be a CV-1 cell which will contain 
at least two reporter genes, each operatively linked to 
a different functional HRE element. In an especially 

30 preferred form, a first reporter gene will be 
operatively linked to wild-type, engineered or 
synthetic glucocorticoid response element, and a second 
will be operatively linked to wild-type, engineer d or 
synthetic estrogen response el ment. in this form, a 

35 first reporter gene will preferably be chloramphenicol 
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acetyltransf erase (CAT) and the second will preferably 
be firefly lucif erase. 

In another assay aspect of the invention, an 
assay is provided for identifying ligand(s) that 
5 activates an orphan receptor* According to this aspect 
of the invention, receptor-deficient host cells are 
provided with at least one reporter gene functionally 
linked to a preselected hormone response element. In 
addition, at least the DNA encoding the P region of the 

10 DNA binding domain of the orphan receptor is mutated, 
preferably by means of site directed mutagenesis, so 
that the mutated orphan receptor can activate the 
preselected hormone response element in receptor- 
deficient host cells.. The host cells containing the- 

15 preselected hormone response element functionally 

linked to at least one reporter gene are contacted with 
mutated orphan receptor, and then challenged with 
candidate ligand(s) which can potentially bind with the 
ligand-binding domain region of the mutated orphan 

20 receptor. Finally, induction of the reporter gene(s) 
is monitored as an indication of those ligand(s) which 
can activate the orphan receptor. 

The compositions, methods and assays of the 
invention, plus preferred methods for making and using 

25 them, are described more fully in the Examples that 
follow. 

EXAMPLES 
EXAMPLE 1 
Finger Module 

30 Previous studies on hormone receptors have 

shown that amino acids throughout the entire DNA- 
binding domain may be important for DNA-binding 
(Hollenberg, t al., 1987). This example discloses 
which amino acids contribute to sequence rec gnition 

35 and are thus r sponsible for determining target g n 
specificity. Experimentally, it is important to know 
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what changes in the DNA binding domain are necessary, 
at a minimum, to allow the glucocorticoid receptor to 
recognize thyroid hormone and estrogen response 
elements. Between the human glucocorticoid receptor 
(hGR) and the human thyroid receptor Jbe ta (hTR/3) less 
than half of the amino acids in this region are 
conserved (Fig. la) . If the entire hGR DNA binding 
domain is replaced by that from the hTRfi , specificity 
is completely switched such that hybrid GTG activates 
only through the TRE and no longer recognizes the GRE 
(Thompson, et al., 1989). 

In the present experiment mutant GR expression 
plasmids were introduced into monkey kidney CV-1 cells 
together with at least one reporter plasmid (e.g./ one 
of the firefly luciferase reporter plasmids) to test 
its target gene specificity and transactivation 
function. For glucocorticoid response the luciferase 
coding seguence was linked to the responsive MTV-LTR 
(MTV-LUC plasmid) (Hollenberg, et al., 1988). For T 3 - 
responsiveness the GREs were deleted from MTV and 
replaced with an oligonucleotide encoding a palindromic 
TRE to generate reporter AMTV-TRE p -LUC (Umesono, et al., 
1988; Giguere, et al., 1989). Upon addition of a 
synthetic glucocorticoid dexamethasone , parental 
receptors hGRnx and GTG elicited around 2,500 and 200 
fold induction over the background level from MTV-LUC 
and AMTV-TRE p -LUC , respectively (Fig. 2). These 
inductions are clearly mediated by the cognate response 
elements. 

EXAMPLE 2 
Receptor Identity; TRE Recognition 
To identify amino acids that allow the 
receptors to discriminate between their cognate HREs, a 
vari ty of chimeric DNA binding domains were 
constructed. In the mutants presented in Figure 2, 
portions of the loops and linkers were exchanged 
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between the hGR and hTR£ or human retinoic acid 
receptor alpha (hRARa) • These switches of Loop 1 (GTG7 
and GTG32) , most parts of the Linker (GTG38 , GRGS , 
GTG36B, and GTG29) , and Loop 2 and adjacent downstream 
5 portions (GTG6, GTG33, and GTG28) . All of these switch 
mutants are active (88% to 7% of parental response) 
upon induction of a lucif erase activity. 

The interchangeable nature of the loops is 
remarkable in terms of dramatic changes in charge 

10 distribution with no change in target gene specificity. 
Furthermore , both small and large substitutions in the 
Linker regions are quite forgiving with no impact on 
promoter specificity. Reducing or expanding the 
spacing between the fingers from 15 (GRnx and GTG29) , 

15 17 (GTG) , 19 (GRsst) and 21 (GRsstG) is again well 
tolerated. From phenotypes of these mutants, it is 
apparent that differences in the primary structure or 
size found in these regions are mutually permissive 
even between two of the most distantly related members 

20 of the receptor super family. Furthermore, since none 
of the Loop or Linker mutants described here has 
changed specificity, it is concluded that these amino 
acids apparently are not critical in discriminating 
between GREs and TREs or TRE p from GREs, but rather are 

25 involved in common aspects of HRE recognition and 
transact ivation . 

The modular or interchangeable nature of large 
segments of the finger regions does not belie the 
fundamental fact that there must be sequences that 

30 determine specificity. By elimination from this series 
of mutants, nine amino acids in two clusters were not 
changed and may therefore impart specificity. The 
first proximal (P) region follows Cj and includes the 
three amin acid cluster GS — V in the GR and EG — G in 

35 the TR. These are part of a 10 amino acid segment that 
is highly cons rved and includes 6 invariant amino 
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acids. When these three amino acids are switched (see 
6T63A in Fig. 2a and GTG36A in Pig. 2b) , the resultant 
chimeras are nonfunctional on either MTV-LUC or AMTV- 
PRE p -LUC. The second or distal (D) region of potential 
5 importance in specificity is a cluster of five amino 
acids between C s and C 6 ; AGRND in hGR and KYEGK in hTR0. 

Indeed, elimination of these amino acids from 
GTG21 to produce GTG15 yields a TRE inactive phenotype 
(Fig. 3). Based on these observations the critical 

10 double switch and mutant GTG8 (Fig. 3) was constructed 
which carries an almost complete hGR DNA-binding domain 
with only 8 hTR0-specific amino acids. Remarkably, 
this chimera induces a full conversion of target gene 
specificity (Figs 3 and 4H) . The induction of 

15 luciferase activity on the TRE p reporter by GTG8 is as 
efficient as GTG itself. 

Thus two noncontiguous "elements" (P and D) 
have been localized in the hTR/J DNA-binding domain that 
are sufficient to impart TRE-specific recognition in 

20 the context of an otherwise normal glucocorticoid 
receptor. Both elements are necessary while neither 
alone is sufficient. Furthermore, acquisition of the 
TRE specificity in this change includes a complete loss 
of GRE responsiveness. 

25 EXAMPLE 3 

Receptor Identity; ERE Recognition 
Although all HREs contain a recognizable 
conserved motif , the TREp and the palindromic 
vitellogenin ERE (EREp) differ only in the spacing of 

30 what are otherwise identical half-sites (5'-TGACC-3' , 
Fig. lc) . Furthermore, recent studies indicate that 
the endogenous T 3 R (Glass, et al., 1988) as well as the 
in vitro translated product of cloned a and p TR cDNA 
(Thompson t al., 1989) can bind with high affinity to 

35 both EREp as well as TREp. Despite this relationship, 
th possibility that the T 3 R might mediate an ERE- 
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dep ndent transactivation has not been carefully 
evaluated. To address this issue an oligonucleotide 
encoding the ERE was inserted into the basal AMTV 
promoter (Hollenberg, et al., (1988)) to make reporter 
5 plasmids AMTV-ERE-LUC and AMTV-ERE-CAT. 

This promoter is indeed estradiol (E 2 ) 
infusible because in CV-1 cells an efficient 
stimulation of CAT activity is dependent on both 
functional ER and the ligand E 2 (Fig. 4d) . The 

10 transfected HER fails to activate through the GRE 

(Umesono and Evans , unpublished data) but does sustain 
a weak activator of the TREp reporter. However , in the 
reciprocal experiment the GTG chimera promotes activity ^ 
of the ERE reporter (Figs 3a and 4d) . In contrast , hGR 

15 does not induce CAT or luciferase activity from a EREp 
or TREp receptor (Figs 3a and 4b). Thus, the DNA 
binding domain of hT 3 R0 is able to recognize both the 
TREp and ERE as response elements in this assay system. 

Because of this close relationship between the 

20 hER and hTR mutant receptors were tested for combined 
ERE and TRE responsiveness to evaluate functional roles 
for the two specificity regions. Taking the activity 
of GTG as 100% , all of the TRE+ mutants in Figures 2 
and 3 (GTxbaG, GTsstG, GTG32, GTG36B, GTG29, GTG28 , 

25 GTG21, and GTG8) showed 70% to 650% activity upon the 
ERE reporter (Fig. 3). Furthermore, mutants carrying 
hTR/? element P but lacking element D (GTG16 and GTG3A) 
also showed significant inductions upon ERE even though 
they were inactive on the TRE p (Figs 3 and 4). These 

30 results demonstrate that hT 3 R0 element P, not D, 

determines a positive phenotype upon the common TGACC- 
type half sites found in ERE arid TRE p . Consistently all 
the mutants containing GR element P (HGRNX, GTG7, 
GTG35, GRG8, GTG6, GTG36A, GTG5) are not responsiv to 

35 the ERE reporter (data not shown) . 
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The phenotype of GT63A (Figs 3c and 4g) has 
clearly revealed that replacement of three amino acids, 
forming element P, is sufficient to convert the 
identity of hGR DNA binding domain (GRE*, ERE", TRE") 

5 into that of ER (GRE", ERE*, TRE") . It is noteworthy 
that this is a complete conversion; that is, this 
mutant is negative upon MTV and only positive upon ERE. 
The ERE responsiveness of mutant GTG3A is not 
restricted to the modified MTV promoter. For example, 

0 ERE-TK-LUC reporter is also regulated by GTG3A while 
the parental TK-LUC that lacks an oligonucleotide 
encoding ERE is not induced (unpublished observations) . 

EXAMPLE & 

Receptor Identity; Dual Specificity 
5 The data presented herein have shown that 

element P, located in the base of Finger 1 and part of 
the adjacent Linker specifies the GRE— ERE responsive- 
ness. This element includes three amino acid dif- 
ferences between hGR (GSCKV) and hT 3 R/5. Interestingly 
0 hTjR/J and hER (EGCKA) , both of which are active upon 
ERE, share almost identical structures when it comes to 
this element (Fig. la) , indicating that these amino 
acid sequences are important for the recognition of 
ERE. 

5 One trend of these results is that by 

imparting a new specificity upon a DNA-binding domain, 
the previous binding specificity is simultaneously 
lost. To determine whether individual amino acids play 
different roles with regard to acquisition or 

3 suppression of HRE specificity, the activity of single 
paired and triple mutants was examined. Although the 
triple mutant (GTG3A) selectively recognizes the ERE, . 
the double mutant (GTG2) is active upon both MTV and 
ERE r port rs (Fig. 3c and 4f ) . This induction is 

> dependent on the HRE since this receptor fails to 
activate eith r AMTV or TREp promoter (Fig. 4). An 
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examination of the phenotype of this mutant indicates 
that the last glycine in the T 3 R£ P element seems to be 
involved in a suppression of the GRE*- without a 
substantial impact on ERE recognition. 
5 Based on this result the question was asked 

whether a single amino acid substitution would be 
sufficient to impart a recognizable change on receptor 
phenotype. Remarkably, by substituting the gluco- 
corticoid receptor glycine or an estrogen receptor 

10 glutamic acid between Cj and C 4 (GT61) a receptor with 
dual specif icity is produced. This single amino acid 
change leaves GRE recognition normal but fosters clear 
recognition of the ERE. As with GTG2 mutant, these 
inductions are dependent on either GREs or EREs (Fig-. 

15 3c and 4e) . However, because all of these mutants lack 
the T 3 R0 D element, they are completely inactive on the 
TREp. 

EXAMPLE 5 
The P and D Elements 

20 As those skilled in the art will appreciate, 

the results presented in the previous Examples provide 
a number of unexpected conclusions concerning the 
mechanisms by which the nuclear receptors identify 
their appropriate response elements. First, the 

25 putative loops of the two zinc fingers can be exchanged 
between receptors without altering specificity. 
Second, two distinct regions, p and D, outside the 
loops, are critical for sequence recognition. Third, 
different amino acids may be involved in acquisition 

30 and loss of HRE responsiveness. Because of this, it 
has been possible via a single amino acid change to 
create a receptor with dual GRE and ERE recognition. 

It is important to understand that thes 
results do not imply that the amino acids in the loops 

35 are not critical for sequence specific binding; rather 
our results show that the amino acids in the loops are 
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not critical in identifying the differences between 
response elements. Thus, it may b presumed that the 
commonalty of loop function indicates that they mediate 
recognition of common aspects of HRE structure. In 
5 contrast, the P and D elements would be involved in 
recognizing the variant nucleotides. This is 
consistent with the observation that HREs are composed 
of a common core sequence that can change in only a 
restricted number of nucleotide positions. This would 

10 suggest a putative topographical alignment between the 
HRE and the zinc finger such that the variable region 
of the HRE would always align with the P element of the 
finger. It can therefore be presumed that amino acids 
in this element make specific contacts with at least 

15 one of the variable positions in the pentomeric HRE. 

In addition to primary sequence recognition 
there is a need to discriminate half-site spacing (ERE 
vs. TRE; Fig. lc) . As summarized in Table 1, our 
results indicate that the first element (P) specifies 

20 the primary nucleotide sequence of the half sites, 
while the second element (D) is important for the 
determination of the half -site spacing, in other words 
the second element in the hGR and hER restricts 
recognition to HREs with three gap nucleotides. 

25 In respect to the amino acid sequence of the 

first element, all members of this receptor superfamily 
can be classified into either GR or ER subfamily (Table 
2) . The GR subfamily includes four members (GR, MR, 
PR, and AR (Evans, 1988) ; Lubahn, et al., 1988); Ham, 

30 et al., 1988), and all of them are able to recognize 
GREs, although physiological effects of each hormone 
are quite different (Ham, et al., 1988). The other 
members constitute the ER/TR subfamily (ER, T 3 Ra, T 3 R0, 
RARa, RAR0, VDjR, NGFI-B (Milbrandt, 1988), TR-2 (Chang, 

35 et al., 1988), v-erbA, ear2 (Miyajima, et al., 1988), 
ear3 (Miyajima, t al., 1988), knirps (Nauber, et al., 



WO 90/11273 



PCT/US90/G1428 



24 

1988), knirps-related (Oro, et al., 1988b), ERR1 and 
ERR2 (for reference, see Evans, 1988) with a slight 
variation. The members of this subfamily may recognize 
ERE, TREp, or a palindromic pair of TGACC sequences 
5 separated by five or more gap nucleotides because of 
the critical glutamic acid residue following Cj. In 
support of this model, our group at the Salic Institute 
for Biological Studies has shown that retinoic acid 
receptor can bind to and activate transcription through 
10 TREp. 

Compared to the structural conservation of the 
first element, the second element is quite divergent. 
According to our hypothesis the second element need not - 
bind directly to DNA but rather may determine spatial 

15 configuration of the half-sites via protein-protein 
interaction. One possibility is that it may represent 
a dimerization interface. This assumption is 
consistent with recent reports that the receptor DNA 
binding domain by itself seems to contain a 

20 dimerization signal (Kumar, et al., 1988; Tsai, et al., 
1988) . 
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TABLE 1 



5 





Phenotypes 


of the Wild- 


-Type 




and Mutant DNA Binding Domains 


element: P 


element D 


GRE 


ERE TREp 


GR 


GR 


+ 




T 3 R£ 


T 3 R^ 




+ + 


GR 


T 3 R/3 


(+) 




T 3 R/3 


GR 




+ 


ER 


ER 




+ 


GR 


ER 


+ 




ER 


GR 




+ 



20 The phenotypes of GR-ER hybrids are from Green, et al., 
(1988) . (+) indicates that a positive phenotype is 
dependent on the construct. 
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TABLE 2 



Structure. .of Element P and Element D 
in GR and ER/TR Subfamilies 



Receptor 



Element P 



Element D 



a) GR Subfamily 
10 GR, MR, PR, AR 



GSCKV 
GSCKV 



AGRND 
AS END 



15 



20 



b) ER Subfamily 
ER 

T 3 Ra 
T 3 P^ 

RARa, RAR/9 
VDjR 
NGFI-B 
TR2 



EGCKA 

EGCKG 
EGCKG 
EGCKG 
EGCKG 
EGCKG 
EGCKG 



PATNQ 

KYDSC 
KYEGK 
HRDKN 
PFNGD 
LANKD 
RGSKD 



V-erbA 
ear2 
25 ear3 

knlrps 

taiirps-related 



EGCKS 
EGCKS 
EGCKS 
EGCKS 
EGCKS 



TYDGC 
RSNRD 
RANRN 
KNEGK 
KNNGE 



30 



ERR1 
ERR2 



EACKA 
EACKA 



PASNE 
PATNE 
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DETAILED DESCRIPTION OP THE DRAWTNGS 

FIGURE 1. a) Comparison of amino acid 
sequ nces among the DNA binding domains of hGR, hT 3 R0, 
and hER (for reference, see Evans, 1988) . Upper 
5 numbers (1 to 9) correspond to the invariant nine 
cysteines found in the receptor DNA binding domains. 
Number adjacent to the seguences indicate amino-acid 
positions of each receptor. Colons show the identity 
of amino acids between hGR and hT 3 R0 or hT 3 R£ and hER. 

10 The hT 3 R0 DNA binding domain contains two extra amino 
acids in the middle of the domain, b) Predicted zinc 
fingers based on those for rat GR18. Numbers 1 to 9 
for cysteines are as in Fig la. Conserved amino 
acids in hGR, hT 3 R/3, and hER are shown by one letter 

15 amino-acid code. Dots represent variable amino acids 
among these receptors. Two zinc ions (Zn) are 
chelated in a tetrahedral coordination by two 
clusters of four cysteines (C, to c 4 and Cj to C 8 ) , 
forming two "zinc fingers" (Finger 1 and Finger 2) . 

20 The two fingers are separated by 15 (hGR and hER) or 
17 (hT 3 R/3) amino acids consisting of Linker between 
the fingers, c) Structures of optimized hormone 
response elements for GR (GRE) , ER (ERE) , and T 3 R 
(TRE) (See Green and Chambon, 1988; Klock et al., 

25 1987; and Glass et al., 1988). Each arrow indicates 
a "half site" of the HREs. These HREs consist of a 
palindromic pair of the half sites as shown by the 
arrows. Both GRE and ERE contain three gap 
nucleotides (nnn) between the half sites, but no such 

30 nucleotide is found in TREp ( ) . Dots in ERE and 

TREp indicate different nucleotides from those of 
GRE. Accordingly, ERE and TREp contain the same half 
site sequence. 

FIGURE 2. Transactivation of lucif erase 

35 r porter plasmids by mutant receptors, a) The amino 
acid sequence of a parental hGRnx DNA binding domain 
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is presented. The hGRnx contains the wild-type hGR 
DNA binding domain flanked by NotI and Xhol sites in 
the cDNA. Numbers 1 to 9 are as in Fig. la. 
Substituted or inserted amino acids in mutants GRxba 
5 and GRsst are shown, b) Structures of chimeric GR 
DNA binding domains carrying local hT 3 R£ (GTG23, GTG7, 
GTG3A, and GTG6) or hRARa (GRG8) sequences. Changed 
amino acids are presented. Mutants are designated 
after the composition of their DNA binding domains; 
10 GTG23 stands for a mutant GR which has a DNA binding 
domain composed of both hGR and hT 3 Rj3 sequences, and 
contains 23 different amino acids from the wild-type 
hGR sequence, c) The amino acid sequence of the hTjR/? 
DNA binding domain in a hybrid receptor GTG is shown; 
15 The GTG is a hybrid GR whose DNA binding domain has 
been replaced by that of hT 3 R£. As in a) changed 
amino acids are presented for GTxbaG and GTsstG. d) 
Chimeric hT 3 R/3 DNA binding domains carrying hGR 
sequences indicated by one letter amino-acid code. 
20 Each receptor expression plasmid was cotransfected 
into CV-1 cells with either glucocorticoid (MTV-LUC) 
or thyroid hormone (AMTV-TREp-LUC) responsive 
reporter plasmids, and the cells were cultured in the 
absence or presence of lOOnM dexamethasone for 36 
25 hours. Transact ivation function of these receptors 
was judged by the induced luciferase activity from 
the reporter plasmids quantified as percentage of 
HGRNX and GTG activities upon MTV-LUC (MTV) (2,500 
fold) and AMTV-TREp-LUC (TREp) (200 fold), 
30 respectively. The activity below the background 
induction level (6 fold for MTV and two fold for 
TREp) is indicated as -. For the activity of GTG18 
on TREp, th "(2)" indicat.s that the induction is 
not dep ndent on TREp. 
35 Figure 2 Methods. hGRnxlO and GTG14 were 

constructed using known and previously published 
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methods. Mutant cDNAs encoding DNA binding domains 
of GRxba, GTG7, GTG3B, GRG8, GTG6, and GTG3A were 
obtain d through an oligonucleotide-directed 
mutagenesis (Kunkel, et al., 1985) of the Notl-Xhol 
5 DNA fragment encoding the hGRnx DNA binding domain. 
Similarly, those for GTxbaG, GTG32, GTG36B, GTG29, 
GTG33, and GTG36A were made by exchanging the Notl- 
Xhol fragment for the hT 3 Rj9 DNA binding domain. 
Mutants GRsst and GTsstG cDNAs contain a SstI linker 

10 (8-mer, New England Biolabs) at filled-in Xball sites 
of GRxba and GTxbaG, respectively. GTG23 and GTG18 
were obtained by exchanging small Notl-Xbal fragments 
between GRxba and GTxbaG. Each of the Notl-Xhol cDNA 
fragments encoding mutant DNA binding domains was 

15 ligated with a large Notl-Xhol fragment obtained from 
pRShGRnxlO. To make GTG28 , Clal site was introduced 
into hT 3 R0nxl4 DNA binding domain cDNA (corresponding 
to amino acids "ID" in the loop of the Finger 2. 
Because the hGRnx cDNA has the Clal site at this 

20 position, the Notl-CIal fragment of the hGRnx cDNA 
was replaced by that of hT 3 R£nx cDNA. All mutations 
were confirmed by the nucleotide sequencing. 
Construction of reporter plasmids MTV-LUC7 and AMTV- 
TREp-LUC21 was done using previously published 

25 methods. Receptor expression plasmid (1 ng) , 

reporter plasmid (5 ng) , p-galactosidase reference 
plasmid (5 jig) , and carrier plasmid pGEM4 (9 tig) were 
cotransfected. Transfection of CV-1 cells (Umesono, 
et al., 1988) and an assay for the lucif erase 

30 activity were done using published methods (de Wet, 
et al., 1987) except that the reference plasmid 
PRSV0GAL was replaced by pRAS-0GAL which contains a 
human c-Ha-ras promoter (Ishii, et al., 1985) linked 
to the coding sequence of 0-galactosidase in a pUC- 

35 derivativ plasmid (pUCGALpA, kindly provided by Dr. 
Richard J.Rickles) , and phenol red in the culture 



WO 90/11273 



PCT/US90/01428 



30 

media was omitted after introduction of the plasmids 
into the cells. 

FIGURE 3. Identification of two distinct 
elements specifying the TKEp+ and ERE+ phenotypes. 
5 Two parental DNA binding domains are presented by 
bold letters. In GTG identical amino acids to those 
in hGR are indicated by dots. Similarly, only 
different amino acids in mutant DNA binding domains 
from those of hGR are shown. The two regions 
10 (element P and D) are marked by boxes. Nomenclature 
of the mutants and numbers are the same as in Fig. 2. 
Mutant receptors were tested for transactivation 
function in the presence of one of the luciferase 
reporters carrying GREs (MTV-LUC) , TREp (AMTV-TREp- 
15 LUC) , or ERE (AMTV-ERE-LUC) after transient 

expression in CV-1 cells (± lOOnM dexamethasone for 
36 hours). Weak stimulation (2) by GTG15 on TREp is 
due to higher background activity by this mutant ,• and 
the 2% induction is not dependent on TREp. 100% 
20 induction of the luciferase activity by GTG on AMTV- 
ERE-LUC (ERE) is 10 fold, and - indicates no 
induction. Relative activities by TRE+ mutants shown 
in Fig. 2 on ERE reporter are as follows: GTG32 
(90%) , GTG36B (140%) , GTG29 (70%) , GTG33 (380%) , 
25 GTG28 (650%). 

Figure 3 Methods. An estrogen responsive 
reporter plasmid AMTV-ERE-LUC was constructed as 
follows: an oligonucleotide encoding a palindromic 
ERE (5 1 -TCAGGTCACAGTGACCTGA-3 * ) (see Glass, et al., 
30 1988) was inserted into the unique Hlndlll site of 
the AMTV-CAT plasmid, in which major GREs between 
position -190 and -88 of MTV-LTR were deleted; this 
g nerated AMTV-ERE-CAT; the CAT gen was subsequently 
r plac d with the luciferase gene obtained from pSVO- 
35 A/L-A A5' (de Wet, et al., 1987), giving AMTV-ERE- 
LUC. Mutant GTG21 is made from GTG32 by introducing 
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the Clal site as in GTG28 (Fig. 2d) . A small Wotl- 
Clal fragment of hGRnx was exchanged by that of 
GTG32-ClaI. GTG15 and GTG8 were obtained through 
site-directed mutagenesis (Kunkel, et al., 1985) of 
5 the cDNAs encoding GTG21 and GTG3A DNA binding 

domains, respectively. Similarly GTG2 and GTG1 were 
made from hGRnx. Transfection and an assay of the 
lucif erase activity were as in Fig. 2. 

figure 4. Induction of CAT activities by 

10 mutant receptors from the basal AMTV-CAT (AM) , T3- 
responsive AHTV-TREp-CAT (TREp) , and estrogen- 
responsive AMTV-ERE-CAT (ERE) reporters. Together 
with one of the reporters, indicated receptor 
expression plasmids from a) to h) were transfected 

15 into CV-1 cells. The receptors were activated by 
adding lOOnM of dexamethasone (D) or 17/3-estradiol 
(E2) for 36 hours. The "-« symbol indicates that the 
solvent ethanol was added. Structures of hGRnx, GTG, 
GTG8, GTG3A, GTG2, and GTG1 were presented in Figs. 2 

20 and 3. The notation "no receptor" indicates that an 
expression plasmid encoding hT 3 R/3 in the reverse 
orientation was cotransfected. hER; human estrogen 
receptor. From e) to h) , changed amino acids are 
shown in a schematic figure for hGR zinc fingers. 

25 The calculated % conversions in the CAT assays are a) 
AM (-) 0.4, (D) 0.5, TREp (-) 0.4, (D) 0.6, ERE (-) 
0.6, (D) 1.0, b) AM (-) 0.4', (D) 0.7, TREp (-) 0.3, 
(D) 0.8, ERE (-) 0.4, (D) 1.2, c) AM (-) 1.8, (E 2 ) 
1.6, TREp (-) 1.6, (E 2 ) 7.0, ERE (-) 1.8, (E 2 ) 78, d) 

30 AM (-) 1.4, (D), 4.6, TREp (-) 1.1, (D) 74, ERE (-) 
1.9, (D) 72, e) AM (-) 0.5, (D) 1.3, TREp (-) 0.6, 
(D) 1.4, ERE (-) 1.2, (D) 20, f) AM (-) 0.8, (D) 1.8, 
TREp (-) 0.9, (D) 3.5, ERE (-) 1.3, (D) 96, g) AM (-) 
0.5, (D) 0.7, TREp (-) 0.5, (D) 0.6, ERE (-) 1.0, (D) 

35 88, h) AM (-) 0.7, (D) 4.3, TREp (-) 0.7, (D) 87, ERE 
(-) 1.0, (D) 95. 
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Figure 4 Methods. Reporter plasmids AMTV- 
CAT (Hollenberg, et al., 1988), AMTV-TREp -CAT (TRE- 
plMCAT (Umesono, et al., 1988; Thompson and Evans, 
1989) , AMTV-ERE-CAT (see legend for Fig. 3) , and an 

5 expression plasmid for hER35 were previously 

described. CV-1 cell transfection has been carried 
out as described in the legend for Fig. 2 except that 
the luciferase reporters were replaced by the CAT 
reporters. For the CAT assay (Gorman, et al., 1982), 

10 cell extracts corresponding to 30 units of B- 

galactosidase activity (Herbomel, et al., 1984) were 
incubated for 3 hours. Calculation of % conversion 
for CAT activity was carried out as previously 
described (Thompson and Evans, 1989). 
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5 BPRfiTFTCATION SUMMARY 

From the foregoing description, one of 
ordinary skill in the art can understand that the 
present invention discloses steroid/ thyroid hormone 
receptor DNA binding domain compositions that 

10 determine target gene specificity. The invention 
further discloses methods converting the target gene 
specificity of one receptor into the target gene 
specificity of another. Still further the invention 
discloses novel assays for identifying functional 

15 ligands for putative or "orphan" hormone receptors. 
These assays are especially useful since they avoid 
the necessity of constructing chimeric genes and 
proteins in order to search for ligands that can 
activate' an orphan receptor. 

20 Without departing from the spirit and scope of 

this invention, one of ordinary skill can make 
various changes and modifications to the invention to 
adapt it to various usages and conditions. As such, 
these changes and modifications are properly, 

25 equitable, and intended to be, within the full range 
of equivalence of the following claims. 
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0 WHAT IS CLAIMED IS: 

1. Element P amino acids selected from the 
group consisting of GSCKV, EGCKA, EGCKG, EGCKS, and 
EACKA. 

2. Element D amino acids selected from the 
5 group consisting of AGRND, ASRND, PATNQ, KYDSC, 

KYEGK, HRDKN, PFNGD, LANKD, RGSKD, TYDGC, RSNRD, 
RANRN, KNEGK, KNNGE, PASNE, PATNE, AGRND, ASRND, 
PATNQ, KYDSC, KYEGK, HRDKN, PFNGD, LANKD, RGSKD, 
TYDGC, RSNRD, RANRN, KNEGK, KNNGE, PASNE, and PATNE. 
10 3. Mutant receptors selected from the group 

consisting of GTG8, GTG3A, GTG2, GTG1. 

4. A method for constructing a receptor that 
can activate both GRE and ERE sequences, said method 
comprising introducing a point mutation into: (A) the 

15 glucocorticoid receptor DNA sequence so that the 
encoded glycine at the site between Cj and C 4 is 
replaced with a glutamic acid, or (B) the estrogen 
receptor DNA sequence so that the encoded glutamic 
acid at the site between C3 and C 4 is replaced with a 

20 glycine. 

5. A substantially pure receptor protein that 
can activate both glucocorticoid and estrogen 
response elements. 

6. A substantially pure receptor protein 

25 according to Claim 5 wherein said protein is produced 
by the method of Claim 4. 

7. A method for identifying functional 
ligand(s) for an orphan receptor protein, said method 
comprising: (A) isolating DNA sequences suspected of 

30 encoding receptor protein; (B) transfecting into a 
suitable receptor-deficient host cell: (1) the DNA 
sequences from step A, and (2) at least one reporter 
gen functionally linked to at least one operative 
hormone response element wherein said hormone 

35 response lement(s) is s 1 cted from the group 
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consisting of wild-type, engineered or synthetic • 
glucocorticoid response element and wild-type, 
engineered or synthetic estrogen response element; 
(C) challenging the transfected host cell from step B 
5 with candidate ligand(s) which can potentially bind 
with the ligand^binding domain region of the orphan 
receptor protein encoded by the DNA sequence of step 
A; and (D) monitoring induction of the reporter 
gene(s) . 

10 8. A method for identifying functional 

ligand(s) for an orphan receptor protein, said method 
comprising: (A) isolating DNA sequences suspected of 
encoding receptor proteins; (B) transfecting into a 
suitable receptor-deficient host cell: (1) the DNA 

15 sequences from step A, (2) at least one reporter gene 
functionally linked to at least one operative hormone 
responsive element wherein the hormone response 
element (s) is selected from the group consisting of 
wild-type, engineered or synthetic glucocorticoid 

20 response element, and (3) at least one reporter gene 
functionally linked to at least one operative hormone 
response element wherein the hormone response 
element (s) is selected from the group consisting of 
wild-type, engineered or synthetic estrogen response . 

25 element; (C) challenging the transfected host cell 
from step B with candidate ligand(s) which can 
potentially bind with the ligand-binding domain 
region of the orphan receptor protein encoded by the 
DNA sequence of step A; and (D) monitoring induction 

30 of the reporter gene(s) . 

9. A method for identifying functional 
ligand(s) for an orphan receptor protein (ORP) , said 
method comprising: (A) identifying at least one 
functional hormone response element that is activated 

35 by th orphan r ceptor protein (ORP) ; (B) contacting 
said orphan receptor protein (ORP) in receptor- 
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deficient host cells with at least one reporter gen 
operatively linked to said functional hormone 
response element identified in step A; chall nging 
said host cells from step B with candidate ligand(s) 
5 which can potentially bind with the ligand-binding 
domain region of said orphan receptor protein (ORP) ; 
and (D) monitoring induction of the reporter gene(s). 

10. A method according to Claim 9 A wherein 
said functional hormone response element is 

10 identified by (A) determining the amino acid sequence 
of at least the hormone response element determining 
portion of the DNA binding domain of said orphan 
receptor protein, and (B) identifying a hormone 
response element that is activated by the amino acids 

15 determined in step A. 

11. A method for identifying functional 
ligand(s) for an orphan receptor protein in a cell 
wherein said cell contains (A) a DNA sequence 
suspected of encoding an orphan receptor protein, (B) 

20 at least one reporter nucleic acid sequence 

functionally linked to an operative hormone response 
element wherein the hormone response element is 
selected from the group consisting of wild-type, 
engineered or synthetic glucocorticoid response 

25 element, and (C) at least one reporter nucleic acid 
sequence functionally linked to an operative hormone 
response element wherein the hormone response element 
is selected from the group consisting of wild-type, 
engineered or synthetic estrogen response element; 

30 said method comprising challenging the receptor- 
deficient cell with at least one candidate ligand and 
monitoring induction of the reporter nucleic acid 
sequence (s) • 

12. A method of any of Claims 7-11 wherein 
35 said cell is a CV-1 cell. 
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13 . A method according to any of Claims 7-11 
wherein the reporter gene is sel cted from the group 
consisting of a chlorampfa nicol acetyltransferase 
(CAT) gene and a firefly lucif erase gene. 

14. A method for identifying a ligand(s) that 
activates an orphan receptor comprising: (A) 
providing in receptor-deficient host cells at least 
one reporter gene functionally linked to a 
preselected hormone response element; (B) mutating at 
least the DNA encoding the P region of the DNA 
binding domain of the orphan receptor so that the 
mutated orphan receptor thus produced can activate 
the preselected hormone response element from step A; 
(C) contacting receptor-deficient cells from step A 
with mutated orphan receptor from step B; (D) 
challenging said cells from step C with candidate 
ligand(s) which can potentially bind with the ligand- 
binding domain region of said mutated orphan 
receptor; and (E) monitoring induction of the 
reporter gene(s) . 

15. A method according to Claim 14 B wherein 
said P region is mutated by sight directed 
mutagenesis . 
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AMENDED CLAIMS 

o^-S by n the ^^a^al Bureau on 17 September 1990 (17.09.90) 
original claims 7 and 8 amended; other claims unchanged (2 pages)] 

1. Element P amino acids selected from the 
group consisting of GSCKV, EGCKA, EGCKG, EGCKS, and 

EACKA. 

2. Element D amino acids selected from the 
5 group consisting of AGRND, ASRND, PATNQ, KYDSC, 

KYEGK, HRDKN, PFNGD, LANKD, RGSKD, TYDGC, RSNRD, 
RANRN, KNEGK, KNNGE, PASNE, PATNE, AGRND, ASRND, 
PATNQ, KYDSC, KYEGK, HRDKN, PFNGD, LANKD, RGSKD, 
TYDGC, RSNRD, RANRN, KNEGK, KNNGE, PASNE, and PATNE. 
10 3 « Mutant receptors selected from the group 

consisting of GTG8 , GTG3A, GTG2 , GTG1. 

4. A method for constructing a receptor that 
can activate both GRE and ERE sequences, said method 
comprising introducing a point mutation into: (A) the 

15 glucocorticoid receptor DNA sequence so that the 
encoded glycine at the site between C3 and C< is 
replaced with a glutamic acid, or (B) the estrogen 
receptor DNA sequence so that the encoded glutamic 
acid at the site between Cj and C A is replaced with a 

20 glycine. 

5. A substantially pure receptor protein that 
can activate both glucocorticoid and estrogen 
response elements. 

6. A substantially pure receptor protein 
according to Claim 5 wherein said protein is produced 
by the method of Claim 4. 

7. A method for identifying functional 
ligand(s) for an orphan receptor protein, said method 
comprising: (A) isolating DNA sequences suspected of 

30 encoding receptor protein; (B) introducing into a 
suitable receptor-deficient host cell: (l) the DNA 
sequences from step A, and (2) at least one reporter 
gene functionally linked to at least one operative 
hormone response element wherein said hormone 
35 response element (s) is .selected from the group 
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consisting of wild-type, engineered or synthetic 
glucocorticoid response element and wild-type, 
engineered or synthetic estrogen response element; 
(C) challenging the transfected host cell from step B 
5 with candidate ligand(s) which can potentially bind 
with the ligand-binding domain region of the orphan 
receptor protein encoded by the DNA sequence of step 
A; and (D) monitoring induction of the reporter 
gene(s) . 

10 8. A method for identifying functional 

ligand(s) for an orphan receptor protein, said method 
comprising: (A) isolating DNA sequences suspected of 
encoding receptor proteins; (B) introducing into a 
suitable receptor-deficient host cell: (1) the DNA • 
15 sequences from step A, (2) at least one reporter gene 
functionally linked to at least one operative hormone 
responsive element wherein the hormone response 
element (s) is selected from the group consisting of 
wild-type, engineered or synthetic glucocorticoid 
20 response element, and (3) at least one reporter gene 
functionally linked to at least one operative hormone 
response element wherein the hormone response 
element (s) is selected from the group consisting of 
wild-type, engineered or synthetic estrogen response 
25 element; (C) challenging the transfected host cell 
from step B with candidate ligand(s) which can 
potentially bind with the ligand-binding domain 
region of the orphan receptor protein encoded by the 
DNA sequence of step A; and (D) monitoring induction 
30 of the reporter gene(s) . 

9. A method for identifying functional 
ligand(s) for an orphan receptor protein (ORP) , said 
method comprising: (A) identifying at least one 
functional hormone response element that is activated 
35 by th orphan receptor protein (ORP) ; (B) contacting 
said orphan receptor protein (ORP) in receptor- 
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